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Computation via Direct Manipulation 



DONALD A. NORMAN and EDWIN L. HUTCHINS 



INTRODUCTION 

This is the final progress report for contract 
N00014-85-C-0133, NR 667-541, Computa- 
tion via Direct Manipulation: Cognitive 
Science Construction Methods for Compu- 
tational Systems, Expert Assistants, and 
Graphical Direction of Complex Systems, 
with the Personnel and Training Research 
Programs of the Office of Naval Research 
(also known, informally, as "the Bridges 
Contract"). This contract was performed 
jointly by members of the Institute for Cog- 
nitive Science at the University of Califor- 
nia, San Diego (UCSD) and members of the 
Intelligent Systems Group of the Navy Per- 
sonnel Research and Development Center 
(NPRDC). Key members of the research 
team were Donald Norman and David 
Owen from UCSD, Jim Hollan, Ed Hutch- 
ins, and Miriam Schustack from NPRDC, 
and Colleen Seifert, an Office of Naval 
Technology and American Society for Engi- 
neering Education Postdoctoral Fellow. 
The total research group over the course of 



the contract period consisted of the follow- 
ing individuals: 

Larry Bierma (NPRDC) 

Bill Gaver (UCSD and Apple Computer) 

MSchael Goeller (UCSD) 

Jim Hollan (NPRDC: resigned in January 
1987 to become technical director of the 
Human Interface Program at the Micro- 
electronics and Computer Technology 
Corporation [MCC; Austin, Texas]) 

Ed Hutchins (NPRDC, then UCSD) 

Masumi Ishikawa (UCSD, visiting from 
MITI Electrotechnical Laboratories, Ja- 
pan). 

Tim McCandless (NPRDC: resigned in Jan- 
uary 1987 to join the Human Interface 
Program at MCC) 

Barbara Morris (UCSD) 

Donald Norman (UCSD) 

Mark Rosenstein (NPRDC: resigned in Jan- 
uary 1987 to join the Human Interface 
Program at MCC) 

David Owen (UCSD) 

Miriam Schustack (NPRDC) 
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Colleen Seifert (Office of Naval Technology 
and American Society for Engineering 
Education Postdoctoral Fellow at 
NPRDC) 

Ron Stanonik (NPRDC) 

Hank Strub (UCSD) 

Mark Wallen (UCSD) 

David Wargo (UCSD) 

Louie Weitzman (NPRDC: resigned in January 
1987 to join the Human Interface Program at 
MCC) 

Larry West (UCSD) 

Melissa Monty Whittaker (UCSD and Xerox) 
Jiajie Zhang (UCSD) 

DIRECT MANIPULATION 
INTERFACES 

Computers are clearly the most powerful 
and most flexible information processing 
tools ever. Given the dominant perspec- 
tive in cognitive science that human cogni- 
tion is best thought of as a form of infor- 
mation processing or computation, it 
seems obvious that computers have a 
greater potential than any other technology 
to enhance or augment human cognitive 
capabilities. But the problem of getting the 
human information processing system to 
work with the machine has not been easily 
solved. Interacting with computers often 
seems difficult and frustrating. Traditional- 
ly, in order to get a computer to do any- 
thing useful, one needed to learn a lan- 
guage in which the computer could be 
given instructions. 

In recent years, primarily in the wake of 
the advent of new input/output technolo- 
gies, a vaguely defined set of characteristics 
parading under the banner of "direct ma- 
nipulation" (Shneiderman, 1982) promised 
to make the computational power of com- 
puters more easily accessible. We set out to 
discover whether and how this might be 
true. We started off with a general analysis 
of the characteristics of direct manipulation 
(DM) interfaces (see Hutchirs, Hollan, & 
Norman, 1986), and this study led us to a 



series of studies aimed at understanding 
how computers can augment human cog- 
nitive abilities. 

One of the most important products of 
our research has been a new understand- 
ing of the context that surrounds our origi- 
nal goals. As our understanding of this 
context grew, topics that had not seemed 
closely related to the issues of DM began to 
seem very relevant indeed. The theme ex- 
pressed in the formal title of the contract 
— direct manipulation — emphasizes the 
central focus: the study of tools that might 
be easier to understand and easier to use 
than conventional ones. But as we pur- 
sued this analysis, we discovered that to do 
the study properly we needed to under- 
stand the wider range of issues surround- 
ing the ways that people perform their 
tasks. This research project has therefore 
broadened to consider a number of the 
overall issues relevant to the study of hu- 
man action and the interactions between 
people and designed artifacts. This has led 
to a number of related studies: 

• The experimental comparison of sys- 
tems that have different properties. This 
was an experimental analysis of the dif- 
ference in acquisition and performance 
of these two different styles of interac- 
tion with two systems for creating draw- 
ings that were formally equivalent in 
power (Schustack, in preparation). 

• The development of a direct manipula- 
tion system for the analysis of statistical 
data. This progrr riming development 
and analytical study allowed us to de- 
velop the implications of such systems 
on behavior (Owen, 1987, 1988). 

• Studies of the role cf naturalistic 
sound. Systems that are "direct" pro- 
vide rich and immediate feedback 
about the actions and system state. 
Sound can provide information about 
the nonvisible aspects of the interaction 
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and of the workings of the system. But 
for the sound to provide a rich source 
of information in a way that is readily 
interpretable, we discovered that it 
must be "naturalistic/' related in natu- 
ral ways to the kind of physical interac- 
tions users expect from their conceptual 
models of the system (Gaver, 1986, 
1988). 

• The development of design rules. The 
various studies of this research contract, 
combined with our earlier studies of er- 
rors in human action (work supported 
by previous ONR contracts), led to the 
recent book The Psychology of Everyday 
Things, which provides a summary of 
the issues and suggestions for design 
(Norman, 1988). 

• The study of socially distributed cogni- 
tion. One new important direction of 
research has been developed through 
the study of naturally situated cognition 
and, in particular, the study of the inter- 
actions and cognitions of the members 
of navigation teams aboard large navy 
ships (Hutchins, in press; Seifert & 
Hutchins, 1988). The computations un- 
dertaken by a navigation team are so 
completely embedded in the manipula- 
tion of computational tools that even 
the participants are often unaware of 
the fact that they are doing computa- 
tions: The specialized navigational in- 
struments provide a setting in which 
all the computation happens outside 
the people who perform it. 

These studies combine to form a cohe- 
sive attack on the understanding of four es- 
sential aspects of performance on tasks: 

1. Detailed analyses of the tasks. 

2 Analyses of the individuals, including 
their information processing structures, 
their knowledge structures, and the 



kinds of conceptual and mental models 
that they form. 

3. The role of social interaction in the per- 
formance of cooperative work, especial- 
ly how social groups share responsibili- 
ties in the performance of a task, 
including the fact that many of the in- 
dividuals may have only limited un- 
derstanding of the complete task do- 
main and that training and error 
correction are often integral parts of so- 
cial task performance, 3*>d help dictate a 
number of the ways in which the task is 
structured and performed. 

4. The study of the tools themselves, those 
artifacts that aid human cognition. 

The four topics provide the theme for 
the analyses presented in this report. In 
this report, rather than provide a simple 
review of the published papers, books, and 
technical reports that have come from this 
project, we provide a new synthesis. This 
report, therefore, is more than just a sum- 
mary: It is a new technical formulation of 
the issues, one that we believe provides 
promise for the understanding of existing 
intelligent systems, suggestions for the de- 
velopment of new systems, and a frame- 
work for future research efforts. 

The Gulfs of Execution and Evaluation 

In doing a task with an artifact, whether it 
be computer or some simpler technology, 
we can identify several aspects of the per- 
formance that affect the way in which the 
task gets done. When a person does an ac- 
tion, there are two major aspects — the ex- 
ecution side of the action and the evalua- 
tion of the resulting world-state. In our 
earlier paper (Hutchins, Hollan, & Nor- 
man, 1986), we described two gulfs that lie 
between the user and the artifact. First con- 
sider the gulf of execution, which lies be- 
tween the user's goals and the possible 
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actions on the interface. In order to use the 
device, the user must span this gulf by de- 
termining which actions on the interface 
will accomplish the current goals and by 
performing those actions. The task of figur- 
ing out what to do and then doing it takes 
effort. If it takes a lot of effort, we say the 
gulf is wide. The other gulf is the gulf of 
evaluation. It is spanned by perceiving, in- 
terpreting, and evaluating the state of the 
world and comparing it to the desired or 
expected (goal) state (Figure 1). We argued 
that one of the most critical issues in inter- 



Goals 
What we 
want to happen 




Execution 
What we do 
to the world 



vtugulf 

of 
< E?&Ciitwn 



Evaluation 
Comparing what 
happened with what 
we wanted to happen 




IRtQulf 
of 

'Evaluation 



FIGURE 1. The gulfs of execution and evaluation. 
Each gulf is unidirectional. The gulf of execution ex- 
tends from user goals to system state; the gulf of eval- 
uation extends from system state to user goals. Every 
expression in the interface language has a meaning 
and a form. Semantic distance reflects the relation- 
ship between the user f s intentions and the meanings 
of expressions in the interface language for both in- 
put and output. Referential distance reflects the rela- 
tionship between the physical form of the expression 
and its meaning. The easier it is to get from the form 
of the expression to meaning, the smaller the referen- 
tial distance. 



face design is the minimization of effort re- 
quired to bridge these gulfs. That is, we 
would like to make the gulfs narrow. But 
what is it that makes them wide? 

Each gulf has two components of dis- 
tance: semantic distance and referential dis- 
tance. 1 

Semantic distance is the relationship 
between the user's intentions and the 
meanings of the expressions that are possi- 
ble in the interface language. It refers to 
how well the interface language expresses 
the user's intentions. Can the user do what 
is intended with simple expressions or are 
complex chains of operations required? 
High-level programming languages can be 
seen as attempts to reduce semantic dis 
tance. The same argument applies to the 
evaluation side of action. Here, semantic 
distance refers to the amount of difference 
between the interpretation of the informa- 
tion available from the environment and 
the information needed to evaluate the en- 
vironment. 

Referential distance refers to the differ- 
ence between the user's understanding of 
the meaning of an expression and the 
user's understanding of its form. Basically, 
when one decides to do a particular action 
sequence (that is, when one forms an in- 
tention to act), before the action can be 
done, its specific manner — its form — 
must be determined. To specify an action is 
to map from its meaning to its form. 
When the meaning and form are similar, 
the mapping can be simple (analogical) and 
the perceived referential distance is small. 

When the form of an expression is un- 
like its meaning, the mapping is complex. 
Symbolic interfaces, for example, are typi- 
cally high in referential distance because 
the relationships between the meaning of 



1 in the earlier work (Hutchins, Hollan, & Norman, 
1986), referential distance was called artxculatoiy dis- 
tance. The name articulators/ is unfortunate, however, 

for reasons that should become clear in the following 
discussion. 
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the expression (a symbolic expressior) and 
the form by which it must be executed is ar- 
bitrary. The user must do the work to 
translate meaning to form. This work is 
the construction and maintenance of struc- 
ture that spans the referential distance. 

The same problem applies in reverse 
on the evaluation side of the action. Here, 
referential distance refers to the difference 
between the forms available in the envi- 
ronment and the meanings that are to be 
extracted from them. When form and 
meaning are similar (such as when an up- 
ward-moving line is to be interpreted as an 
increase in the value of the variable of in- 
terest), referential distance is small. When 
form and meaning are dissimilar or unre- 
lated (as when the change in value must be 
computed from a series of numbers), refe- 
rential distance is large. 

Simple mappings between form and 
meaning occur when the form of the ex- 
pression is similar to the meaning of the 
expression or when the structural relations 
among forms are similar to the structural 
relations among meanings. Consider the 
meanings of the words left and right. Now 
imagine two sets of expressions for these 
meanings, first, the words themselves — 
left and right — and second, some symbols 
— «■ and . The first set, the words, has 
high referential distance because the form 
of the symbol left has little to do with the 
direction being indicated except by the arbi- 
trary convention of the translation of let- 
ters to words and words to meanings. We 
think the relationship is "natural" only be- 
cause we have learned it so well. But note 
what happens when we change languages 
to, say, Spanish or French: the physical 
form of the symbols changes dramatically 
(from left to izquierdo or ganch) while the 
meaning remains unchanged. 

Now consider the pictorial symbols 
and + . These symbols have a small refe- 
rential distance. Notice that this is not be< 
cause the symbol *■ has a structure similar 
to the meaning left. In fact, the structure of 



*■ does not inherently possess any mean- 
ing: Our understanding of it is entirely 
conventional and probably had to be 
taught upon first encounter. The claim to 
meaning similarity is "second order": The 
elements of the contrast set <■ and ■+ hav 
a structural relationship to each other that 
is similar to the meaning relationship be- 
tween the words left and right. It is the rela- 
tionships that are similar. What matters is 
not the mappings between individual sym- 
bols and individual meanings (because 
that may be arbitrary), but rather the map- 
pings between sets of symbols and sets of 
meanings. 

We can illustrate the differences be- 
tween semantic and referential distance by 
considering the results of an experimental 
comparison of two interfaces that have dif- 
ferent interaction styles. 

An Empirical Exploration of DM 

In an empirical study of DM interfaces 
(Schustack, 19xx), users were given a task 
to perform using two drawing programs 
that were very different in style but with 
almost identical functionality. One of the 
interfaces had many features that should 
help reduce referential distance: For exam- 
ple, it was WYSIWYG (what you see is 
what you get), it provided inst ntaneous 
feedback on each action as it was per- 
formed, and all possible actions were con- 
tinuously visually represented. It was 
mouse-driven, and it allowed the user to 
move around the picture space by analo- 
gous movements on the mouse pad. The 
other interface had none of these character- 
istics: Users wrote programming scripts in 
symbolic language, relying on their memo- 
ry or the documentation to generate the 
elements and the syntax, and getting feed- 
back on the syntactic acceptability of the en- 
tire script (and the resulting picture, if the 
syntax is all correct) only after moving 
from script-editing mode to execution 
mode. 
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These interface differences affected the 
nature of the errors that users made, the 
relative difficulty of the various drawings, 
how long it took users to reach a criterion 
level of performance, and how well they 
exploited the more s sphisticated features of 
the programs. 

The visually oriented interface allowed 
users to complete the pictures significantly 
faster than the command-languags style in- 
terface. This was a very robust finding that 
held up over the set of pictures as a whole 
Ohat is, the time to complete the series was 
significantly shorter for the visual-style in- 
teiface), and it also tended to be true of the 
individual pictures. This global advantage 
in completion time occurred even if the 
time spent in initial document reading and 
practicing is excluded. Even though the us- 
ers of the command-style interface spent 
more time on these preliminary activities 
before they even started copying the first 
picture, the copying task itself still took 
them substantially longer than it took us- 
ers of the visual-style interface. 

The first, very simple picture (which 
was 2 circle in the middle of the picture 
space) took users of the command-style in- 
terface about a half hour to complete, while 
the visual-interface users were finished in 
close to five minutes. Just getting enough 
control over the command-style interface 
to make it draw the simplest picture im- 
poses a substantial burden on the new user. 
There was no such "start-up cost" apparent 
for users of the more visual interface — 
they showed no speed-up from the first to 
the second picture, while the command- 
style interface useis brought their times 
down from the half hour it took for the 
first picture to about 10 minutes for the 
c ^cond. 

But, even limiting the focus to simple 
quantitative measures, the situation is not 
all one«i,:-i8d. Some of the pictures had 
characteristics that made them more quick- 
ly accomplished in one interface than in 
toe ether. In terms of Mme-to-completion, 



this effect not only reduced the speed bene- 
fit of the visual-style interface for certain 
pictures, but in a few cases reversed it to a 
speed benefit for the command-style inter- 
face. One of the consequences of the stylis- 
tic differences between the interfaces is that 
there are certain tasks that are auite well 
suited to the features of one interface and 
less well suited to the other. For example, 
the placement of labels (alphabetic strings) 
in a picture can be done quite easily in the 
command-style interface but is more cum- 
bersome in the visual-style interface, For 
this task, the command-style interface 
clearly had less semantic distance than the 
visual-style interface. 

The performances with the two inter- 
face also differed in tne quality of the pic- 
tures prod jed (evaluated by rating eight 
independent dimensions of each copy pro- 
duced). By and large, the copies produced 
were extremely good reproductions of the 
originals in an absolute sense. The pictures 
got an overall average rating of over 4.5 on 
a 5-point scale (where 5 represented perfect 
-eproduction). There were two major char- 
acteristics of the quality of the pictures that 
are notable. First, the users of the com- 
mand-style interface were overall signifi- 
cantly more accurate in their pictures, and 
were more accurate on every rated dimen- 
sion on which there were any differences. 
The characteristics of the command-style 
interface made some aspects of fine-tuning 
fairly simple and straightforward (al- 
though not necessarily fast); in the visual- 
style interface users ran a substantial risk of 
ruining whatever version they already had 
in trying to make small changes. Thus, the 
command-style users were more willing to 
keep working until they were completely 
satisfied, whereas the visual-style interface 
users stopped when the picture was "good 
enough" (which was objectively quite good 
in most cases). Second, echoing the finding 
for L » completion times above, there were 
certain pictures that were more suited to 
the strengths of each interface. Pictures 
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with features that were differentially easy 
to accomplish in the two interfaces showed 
differences between the interfaces in their 
accuracy (in both directions). 

These results illustrate the importance 
of realizing that there is more to direct ma- 
nipulation than short referential distance. 
On the one hand, the visual-style interface 
had low referential distance, but the prob- 
lems subjects had in cleaning up rough 
drawings and placing text labels indicate it 
had high semantic distance with respect to 
some aspects of the tasks. The command- 
style interface, on the other hand, appeared 
to have high referential distance — the 
meanings of the commands were arbitrari- 
ly related to their form — but low semantic 
distance. With the visual-style interface, it 
was easy to specify something to do, but of- 
ten difficult to specify just exactly what one 
wanted. With the command-style interface 
it was possible to specify exactly what one 
wanted, but difficult to generate the expres- 
sion that would do it. 



Everyday Listening and Auditory Icons 2 

An importan' method for enhancing di- 
rectness is through appropriate feedback. 
This can be done visually or auditorily. 
Little is known about the role of sound in 
providing this kind of information. Part of 
our research efforts have examined the 
role of sound. The studies, conducted by 
Gaver, represent an attempt to understand 
how people listen to the world, and how 
such an understanding can help in devel- 
oping auditory interfaces for computers. 

Everyday Listening 

There can be little doubt that sounds con- 
vey a great deal of information to listeners 



2 This section consists of edited excerpts from Wil- 
liam W. Gaver's doctoral dissertation, "Everyday Lis- 
tening and Auditory Icons." 
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about events in the world. Yet relatively 
little attention has been given to how this 
happens. Instead, most studies of sound and 
hearing have been attempts to understand 
either how music is perceived, how the au- 
ditory system transduces sound, or how to 
measure and reduce environmental noiso. 
There are, of course, notable exceptions to 
this rule (e.g., work on auditory streaming, 
localization, and speech), but on the whole, 
music, transduction, and noise seem to de- 
fine the major interests in audition. 

One large set of our studies were an at- 
tempt to define and explore a new way of 
understanding how people hear. In this re- 
search, we were concerned with under- 
standing everyday listening, the act of gain- 
ing information about events in the world 
by listening to the sounds they make. He 
takes as a basic hypothesis that people do 
listen to events, and that they can obtain 
information about various aspects of these 
events by listening to them. From this be- 
ginning, he goes on to address two primary 
questions about these abilities: What do 
we hear?, and How do we hear it? 

Work on a basic understanding of eve- 
ryday listening is complementary to the de- 
velopment of auditory icons. On the one 
hand, the attributes of sonic events that 
may be mapped to attributes of events in 
the computer are suggested by an under- 
standing of what we hear; and the ways to 
convey these attributes clearly to users are 
precisely the concern of research on how 
we hear. On the other hand, work on au- 
ditory interfaces is invaluable in testing the 
results of basic research in a more natural 
setting, and helps generate intuitions about 
everyday listening. Pursuing the basic and 
applied research together has proved to be 
a valuable course. 

Auditory Icons 

Though research on everyday listening is 
clearly only in its very early stages, already 
an application of this perspective on 
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audition has proved to be quite useful. This 
application involves using everyday 
sounds to provide information to comput- 
er users about events in the interface. 
When sounds are used in this way, the re- 
sults are called auditory icons because of 
their similarities to visual icons. 

Auditory icons hav ebeen described in 
three papers that discuss his work on audi- 
tory icons. First is the paper that intro- 
duced this idea (Gaver, 1986), and which 
lays the framework for further research in 
this area. Second is a technical report writ- 
ten recently for Apple Computer, Inc. (Gav- 
er, 1988), which is a survey of current tech- 
niques for using sound, and which 
explores the issues involved in creating au- 
ditory icons in more detail. Last is a paper 
that describes the SonicFinder™, a proto- 
type auditory interface developed by Gaver 
at Apple Computer, Inc. (Gaver, in press). 
This interface illustrates the use of auditory 
icons in an actual system. 

These three papers show the develop- 
ment of auditory icons. In addition, the 
Apple technical report provides a broader 
view of how sounds can be used Inter- 
faces, and the paper on the SonicFinder 
makes the ideas more concrete by provid- 
ing specific examples of how auditory icons 
may be constructed. 

Everyday Listening and Auditory Icons 

A number of people have recently been ex- 
ploring the possibilities of using nonspeech 
audio in interfaces (e.g., Bly, 1982; Mansur, 
1984; Mezrich, Frysinger & Slivjanovski, 
1984; Morrison & Lunney, 1985; Sumikawa, 
Blattner & Greenberg, in press). But our 
strategy behind auditory icons is considera- 
bly different from those used by other re- 
searchers, and has several advantages as a 
way of conveying information from com- 
puters. 

The crux of the differences between au- 
ditory icons and other methods of using 
sound is that auditory icons are based on 
an understanding of everyday listening, 



while other techniques rely on musical 
manipulations of sound to convey data. 
This has crucial effects on the kinds of 
mappings between sounds and informa- 
tion that can be created using these strate- 
gies. Because the attributes of everyday lis- 
tening are those of sound-producing 
events, a nomic or iconic mapping can 
hold between auditory icons and events in 
the computer. Systems that rely on musi- 
cal manipulations of sound can only em- 
ploy metaphorical or symbolic mappings. 
Because of this difference, auditory icons 
are likely to be more intuitively obvious to 
users, more easily learned and remem- 
bered, and more likely to increase users' 
feelings of working directly in the task do- 
main, rather than on a computer. 

It is important to note here that the 
promise of auditory icons is entirely based 
on the way of understanding "everyday lis- 
tening." That such a new area of research 
should prove so fruitful is heartening: 
The appeal of auditory icons must be taken 
as support for the notion of everyday lis- 
tening. 

In addition, work on everyday listening 
and auditory icons is intimately related. 
Auditory icons depend on an understand- 
ing of everyday listening so that computer 
events can be reliably and usefully mapped 
to sound. Conversely, work on auditory ic- 
ons can help provide hypotheses and evi- 
dence for research on everyday listening. 
Finding useful auditory icons often makes 
apparent interesting issues about what 
events people hear, issues that may lead to 
productive empirical examinations. In 
this way, work on auditory icons supple- 
ments studies of physics, protocol studies, 
and more rigorous experimental work in 
helping to understand everyday listening. 

COGNITIVE ARTIFACTS 

As we studied direct manipulation systems 
and contrasted them with command- 
language systems, we became aware that 
the questions really focused upon a more 
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general concern: the manner by which peo- 
ple interact with the world. This takes on 
special concern with computer systems, for 
with the computer, there is only a second- 
order interaction with the world. The com- 
puter contains a representation of objects 
and information in the world and the in- 
teraction is done upon this representation. 
The indirectness leads to the gulfs that we 
identified in our earlier studies. And the 
indirectness leads to both the power and 
the difficulty of the interaction. The major 
feature of a direct manipulation interface is 
its attempt to minimize the indirectness by 
minimizing the distinction between the 
appearance and requirements of the repre- 
senting world of the computer and the ex- 
ternal, physical world — the represented 
world. Still, direct manipulation interfaces 
are really only one example of a more gen- 
eral issue: the study of artificial systems in 
general and, especially, the characteristics 
of artifacts that aid human cognition. 

Most artifacts aid physical requirements 
and enhance physical abilities. Some arti- 
facts, however, increase our cognitive abili- 
ties: We use the term "cognitive artifacts" 
to refer to these objects. Although cogni- 
tive artifacts are less numerous and less 
well studied than physical artifacts, they 
play an increasingly important role in our 
lives. Maps, drawings, blueprints, lists, 
tables, calendars, books, slide rules, calcula- 
tors: These are all examples of cognitive ar- 
tifacts. Computers are a special class of arti- 
fact and they deserve special treatment, for 
they can do things that are difficult or im- 
possible for other artifacts. Still, they are ar- 
tifacts, and an understanding of the psy- 
chology of our interaction with things 
must be relevant to our interactions with 
computerized things. 

Computational Power in Artifacts 

The computer can function as a powerful 
and independent information processing 
device. As a result, we often concentrate 
upon its power to do tasks previously done 



by humans, and then view the problem of 
human computer interaction (HCI) as one 
of specifying a division of labor between 
human and computer and designing a 
communication protocol that can be used 
to coordinate the efforts of the two partici- 
pants. This is, of course, an important pos- 
sibility for HCI, but it is not the only one, 
and assuming it prematurely may lead one 
to overlook some other important aspects 
of interactions with artifacts in general and 
computers in particular. 

Our view of cognition assumes neither 
that cognition happens all "in the head" 
nor that useful cognitive artifacts necessar- 
ily achieve their results by augmenting or 
amplifying the cognitive abilities of users. 
Instead, we look for the power of cognitive 
artifacts in the interaction of what is inside 
the person and that which is outside the 
person. 

Checklist as Cognitive Artifact 

One simple (noncomputer) artifact that we 
have examined in some detail is the sim- 
ple checklist, used to organize tasks when- 
ever it is essential that a list of actions all 
be performed, oftentimes in particular or- 
der (Hutchins, 1986). The checklist organiz- 
es the behavior of the task performer in a 
way that may not be possible for the unaid- 
ed performer. In order to use a checklist as 
a guide to action, the task performer must 
coordinate with both the checklist and the 
environment in which the actions are to 
be taken. 

Achieving coordination with the 
checklist requires the actor to invoke proce- 
dures for the use of the checklist. These in- 
clude reading skills and a strategy of se- 
quential execution that permits the task 
performer to ensure that the steps will be 
done in the correct order and that each step 
will be done once and only once. The fixed 
linear structure of the checklist permits the 
user to accomplish this by simply keeping 
track of an index that indicates the first un- 
executed (or last executed) item. 
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While the task performer is following 
the checklist, high-level control of task- 
related behavior is given to the structure of 
the artifact. It might seem that the actor al- 
ternates coordinating with the structure of 
the checklist and coordinating with the 
structure of the task world. However, the 
coordination with the two media is in fact 
simultaneous to the extent that under- 
standing a step in the description may de- 
pend upon understanding the state of the 
world in which it is to be carried out. The 
experience of the meanings of the descrip- 
tions of the steps incorporates experience of 
the task v/orld, and the doing of the actions 
in the task world incorporates the experi- 
ence of the meaning of the task steps. The 
importance of this is that in this mediated 
performance, the person provides continu- 
ous coordination among several structured 
media, some internal, some external. 

It is easy to show that a novice task per- 
former has no internal representation of 
thp entire checklist. The large-scale struc- 
ture of the task performance is in the or- 
ganization of the checklist, not in the mind 
of the task performer. With practice, of 
course, the novice may learn the procedure 
and the large-scale structure of the task 
may come to be represented in the mind of 
the performer. As was the case with the 
typewriter keyboard, internalizing such 
structure may lead to much more rapid 
and efficient performance. That, however, 
is another story. At this point in our argu- 
ment we wish only to illustrate the fact that 
people can make use of structure in their 
environment to transform their cognitive 
abilities without having any enduring in- 
ternal representation of that structure. 

Problem Solving as Re-Representation 

Cognitive artifacts, whether computerized 
or not, are always vehicles for representa- 
tions of the problems they are used to 
solve. The way a problem is represented 
can radically change what is required of the 



problem solver and a good artifact will, 
therefore, use representations that match 
those of their users. In fact, Simon has sug- 
gested that "solving a problem simply 
means representing it so as to make the so- 
lution transparent" (Simon, 1981, p. 153). 
Many artifacts can serve as representation- 
al transformers, taking the representation 
of the problem from one format into a for- 
mat better suited for use by people. 2 



Five Ways to Do Distance/Rate/Tirne 
Problems 

An example from Hutchins's (in press) 
study of ship navigation demonstrates 
how the structure of representational arti- 
facts can change the cognitive activities re- 
quired to do a task. Imagine a navigator 
who has just plotted a position fix and 
needs to compute the ship's speed based on 
the distance the ship has moved in the in- 
terval of time that elapsed between the cur- 
rent fix and the previous one. This is a 
standard distance/rate/time problem. For 
this particular exercise assume that the two 
fix positions are 1,500 yards apart and that 3 
minutes of time have elapsed between the 
fix observations: What is the ship's speed 
(in knots)? 

Consider the cognition required under 
five different conditions; 

Condition 1: The performer has the fol- 
lowing resources: paper and pencil, 
knowledge of algebra, knowledge of 
arithmetic, knowledge that there are 
2,000 yards in a nautical mile and 60 
minutes in an hour, and knowledge of 
the equation D = RT. 



3 Hill (1988) argues that artificial intelligence serves 
primarily as the developer of representational vehi- 
cles that permit the human intellect to express itself 
in ways more powerful than previous methods; more 
powerful in the sense that they match better human 
capabilities and structures. 
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Condition 2: The task performer has the 
same resources as in Condition 1, except 
that instead of a paper and pencil, the 
task performer has a four-function 
pocket calculator. 

Condition 3: The task performer has a 
Ihree-scale nomogram (see Figure 2) 
and knows how to use it. 

Condition 4: The task performer has a 
nautical slide-rule (see Figure 3) and 
knows how to use it. 

Condition 5: The task performer has no 
material implements at all, but knows 
how to use what navigators call the "3 
minute" rule. 

In Condition 1 (paper and pencil), the 
task performer will first have to use the 
knowledge of algebra to manipulate the 
formula D=RT to the form R = D/T so ihat 
rate can be solved for directly from the giv- 
en values of D and T. The distance in yards 
will have to be converted to the equivalent 
number of miles (requiring knowledge of 
the number of yards in a mile and the rele- 
vant arithmetic). The time in minutes will 
have to be converted to the equivalent 
number of hours (requiring knowledge of 
the number of minutes in an hour and, 
again, arithmetic). The distance measure 
must be divided by the time measure (re- 
quiring arithmetic again) to get the rate. Of 
course, these things can be done in a differ- 
ent order; for example, the division could 
come before either of the unit conversions, 
or between them, but in any case all these 
things must be done at some point in order 
to solve the problem. 
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D = RT 
FUD/T 

1500 yards = 1500/2000 = 3/4 miles 
3 minutes = 3/60 = 1/20 hours 
R = (3/4) 20 = 15 knots 
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FIGURE 2. The time/distance/speed nomogram. 
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FIGURE 3. Navigation slide rule. 



Although the arithmetic and algebraic 
skills required to solve this problem are 
reasonably simple, they would tax the abili- 
ties of many navigation practitioners in the 
navy. The problem is not that the arithme- 
tic is difficult, but rather that it is Necessary 
to figure out what to do and how to fit the 
intermediate steps together while in the 
real, operational setting of time and social 
pressures and task demands. One may be 
perfectly capable of doing every one of the 
component subtasks in this problem, but 



fail completely for lack of ability to orga- 
nize and coordinate the various parts of 
the solution with each other. 

In Condition 2 (the calculator), the pro- 
* cedures for doing the arithmetic operations 
of division and multiplication are restruc- 
tured so that instead of decomposing the 
problem to a set of operations on paper, 
values are keyed into the calculator and 
buttons pushed. Depending upon the order 
in which the steps are taken, it may be ne- 
cessary to remember a previous result and 
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enter it into a later operation, after other 
operations have intervened. This version 
of the task would probably also tax the abil- 
ities of many navigation practitioners, be- 
cause the hard part is not doing the arith- 
metic but in deciding how to coordinate 
the arithmetic operations with each other. 
The calculator gives no support for that 
part of the task. 

Conditions 1 and 2 (paper and pencil 
and calculator) are alike in that they use 
general computational knowledge which 
gives little help in structuring the actions. 
Because of this, the computation is com- 
plex, and it contains many steps, especially 
if we count as a step the writing of each 
symbol or each key press of the calculator. 

Now consider Condition 3 (the nomo- 
gram). A nomogram is a printed form con- 
taining scales, usually linear or logarith- 
mic, so arranged so that a straight line 
drawn through the known values on the 
scales intersects the other scales at the solu- 
tion points. In the case of this navigation 
problem, there are three logarithmic scales: 
one for rate (usually calibrated in knots), 
one for distance (usually calibrated in both 
yards and nautical miles), and one for time 
(usually calibrated in seconds, minutes, 
and hours). A straight line through any 
two known points on any two scales inter- 
sects the third scale at whatever value 
solves the D = RT equation. Thus, to solve 
the particular problem under discussion, 
one simply makes marks at 3 minutes on 
the time scale and at 1500 yards on the dis- 
tance scale. Then one draws a line through 
those two marks with a straightedge: The 
answer of 15 knots can then be read on the 
speed scale where the drawn line intersects 
the scale. This procedure is easy for every- 
one, and well within the limits for naviga- 
tion practitioners in the navy. 

Of course, Condition 3 has an unfair ad- 
vantage over Conditions 1 and 2. The hard 
work was done in making the nomogram 
in the first place. But that is part of the 
point. This is a very frequently occurring 



problem, and this tool is designed specifi- 
cally to make its solution easy. 

Condition 4 (the nautical slide rule) 
provides an analysis similar to that of Con- 
dition 3: A slide rule is simply a general- 
ized, mechanical version of the nomo- 
gram, where the movement of the scales 
and of the sliding hair line corresponds to 
the setting of the points and the drawing of 
the line on the nomogram. In this case, 
one aligns the distance index with the de- 
sired distance on the distance scale, aligns 
the elapsed time index with the desired 
time on the time scale, and then the speed 
index points to the speed in knots. 

In both the nomogram and the slide 
rule, having the scales in the units nor- 
mally encountered eliminates the need to 
convert one kind of unit into another, but 
much more important is the fact that these 
two conditions eliminate the need for any 
knowledge of algebra. The nomogram and 
slide rule transform the task from one of 
computational planning — figuring out 
what to divide by what — to one of simple 
manipulation. In the first two conditions, 
knowledge of the numbers and the equa- 
tions provide little assistance in knowing 
exactly what to do. When using the nomo- 
gram or the slide rule, the structure of the 
artifacts themselves make it almost impos- 
sible to fail: Just enter the knowledge 
known into the device and it provides the 
value for the missing term. The relations D 
= RT,R = D/T, and T = D/R are built into 
the structure of the nomogram and slide- 
rule. 

With the proper artifact, although the 
task performer still needs to know some- 
thing, the knowledge that is required is less 
complicated and less general than that re- 
quired with just paper and pencil or calcu- 
lator. What needs to be done can be in- 
ferred from the structure of the artifacts 
themselves. They constrain the task so that 
many errors that are easy (and frequent) 
with pencil and paper or calculator are not 
even possible with the artifact. Much of the 
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computation is done by the tool (or by the 
designer of the tool). The person can get by 
doing less because the tool does more. 

But now consider Condition 5, the use 
of a specialized internal artifact. A general 
computation rule (the 3-minute rule) is 
that the number of hundreds of yards a 
ship travels in 3 minutes is its speed in 
knots. 4 In our problem the ship traveled 
1,500 yards in 3 minutes, so we instantly 
"see" that its speed is 15 knots. The naviga- 
tor need only remove the last two digits 
from the distance in miles: Voila! 15. 

Now the reader may really cry "foul 
play." 'This is a special example with spe- 
cial, made-up numbers that permit this 
trick to be used," you might say. True, but 
because the rule is so simple, navigators try 
to determine distances in yards every 3 
minutes. Navigators are capable of measur- 
ing their distance every 2 minutes or even 
ever) minute, but three minutes is more 
common, not because it meets the needs of 
the ship better than the other intervals, but 
because it meets them well enough, and it 
makes this computation so convenient. 5 

Artifacts — Amplifiers or Transformers 
of Cognition 

It has now become commonplace to speak 
of technology, especially information pro- 
cessing technology, as an amplifier of cog- 
nitive abilities. Cole and Griffin (1980) 
show, however, that the appearance of am- 



* One hundred yards is 1/20 of a nautical mile and 3 
minutes is 1/20 of an hour. So the r umber of hun- 
dred yards traveled in 3 minutes is the number of 
miles that will be traveled in 1 hour, which is the 
speed in knots. 

5 The nautical slide rule and nomogram are normal- 
ly only used when the ship is away from land and the 
fix intervals are much longer than 3 minutes. When 
the cycle is performed on the shorter intervals of 1 or 
2 minutes, speed is normally computed by conversion 
to the 3-minute standard. For example, if the ship 
travels 800 yards in 2 minutes, it would travel 1200 
yards in 3 minutes, so its speed is 12 knots. 



plification is an artifact of a commonly as- 
sumed perspective. When we concentrate 
on the product of the cognitive work, cul- 
tural technologies, from writing and math- 
ematics to the kinds of tools we have con- 
sidered here, appear to amplify the 
cognitive powers of their users. Using 
these tools, people can certainly do things 
they could not do without them. When we 
shift our focus to the process by which cog- 
nitive work is accomplished, however, we 
see something else: Cognitive abilities of 
unchanged power are reorganized in inter- 
action with the structure of the technology. 

There are two important things to no- 
tice about these alternative ways to do an 
algebraic task. First, the very existence of 
these five different ways is evidence of a lot 
of effort directed toward avoiding the use 
of algebraic reasoning, and arithmetic. Sec- 
ond, these tools and techniques permit the 
task performer to avoid doing algebra, rea- 
soning and arithmetic, replacing these ac- 
tivities with aligning indices on scales, or 
simply deleting zeros from numbers. 
These tools transform the task by mapping 
it into a representational medium in 
which the method or answer is apparent. 

Note that the required procedures may 
be trivial, but they are not obvious. One 
can scarcely imagine a simpler procedure 
than the application of the 3-minute rule, 
yet many who use it have no idea why it 
works and would never have discovered it 
on their own. We return to this point later. 

Let us now summarize the argument 
so far. The power of cognitive artifacts de- 
rives from the fact that they form or trans- 
form cognitive tasks. An artifact that trans- 
forms a task can radically change the kind 
of mental structure that is required to 
bridge the gap between user intentions and 
actions on the device interface (the world). 
They also create entirely new relationships 
to the world. A cognitive artifact that trans- 
forms tasks reduces semantic distance by 
simplifying what must be done to solve a 
problem. This is very clear in the case of 
navigation instruments that replace 
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algebraic and arithmetic operations with 
simple manipulations of external or inter- 
nal structure. Even the simple checklist re- 
duces the semantic distance for its user. 
Lacking the checklist, the novice must dis- 
cover the steps that need to be done and an 
order in which they can be applied. With 
the checklist, the task is transformed: read- 
ing and following instructions take the 
place of procedural reasoning. 

The Cottage Cheese Story 

In a study of weight watchers in Southern 
California, de la Rocha reports 6 the follow- 
ing strategy is employed by a weight watch- 
er. The menu for the meal calls for 2/3 of a 
cup of cottage cheese, but the dieter only 
wants to make 3/4 of the recipe amount. 
How can the dieter computer the proper 
amount of cottage cheese? In the case re- 
ported, the person filled a measuring cup 
up to the 2/3 mark with cottage cheese, 
dumped the measured cottage cheese on a 
plate, patted it out into a pancake, drew an 
X on the top that divided it into four "pie 
sections" of equal size, and then used three 
of the four sections. 

The recipe provides an initial set of 
symbolic constraints, but instead of operat- 
ing on the symbolic representation, the 
representation was interpreted into the 
physical medium of cottage cheese. This 
provided a mapping of the symbolic state- 
ment of the problem onto manual action 
in the physical world. In reviewing this 
and related examples of real world arith- 
metic reasoning, Resnick (1987) notes: 

They got reliable arithmetic results by 
treating the material they were working 
with as part of their calculation process, 
rather than by just operating on sym- 
bols, (p. 14) 



6 This account originally appeared in a chapter by 
Lave, Murtaugh, and de la Rocha (1984). 



To do the task symbolically, one would 
have to multiply the fractions together. 
One way to do that would be to notice that 
the 3 in the numerator of 3/4 cancels the 3 
in the denominator of 2/3, giving 2/4, 
which is easily transformed to 1/2. The op- 
erations are easy for one experienced with 
the relevant arithmetic, but if one just 
thinks about the problem it has the poten- 
tial to be difficult (taking 3/4 of 2/3 seems 
hard, something like, perhaps the taking of 
5/7 of 8/9). Moreover, the arithmetic pro- 
ceeds according to syntactic rules that re- 
spond to the form of the expression and 
have nothing whatever to do with cottage 
cheese. Only after the syntactic operations 
have been completed, is the new symbolic 
form, 1/2 cup, interpreted in the world of 
cottage cheese. It is the sensitivity to form 
rather than meaning and the suspension 
of interpretation during syntactic opera- 
tions that makes it a formal system. For- 
mal systems are thus liberated from the 
constraints of concrete reality. But the gain 
in power is at the expense of clarity. 

The theme of liberation from con- 
straints of concrete reality is prevalent in 
the philosophical and developmental liter- 
ature, and the history of Western civiliza- 
tion is full of demonstrations of the enor- 
mous power of formal systems. Yet, this 
poses a computational burden on people in 
the requirement for syntactic knowledge 
and mediation of the formal operations: 
We call this the burden of syntax. 

As observers and actors we may certain- 
ly intend what we do in the world and in- 
terpret the consequences. But the world it- 
self neither interprets nor intends 
consequences. We take the point of view of 
an observer, interpreting actions and states: 
The world is a system with neither repre- 
sentations nor interpretations — the world 
simply responds (this view is presented in 
expanded form in Gaver, 1988). Thus, al- 
though our actions have meaning to us 
and to other observers, they do not to the 
world. The notion of "the meaning of an 
expression'' is a linguistic notion. It implies 
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a reference gap — a relationship between 
one thing and some other thing that it 
"represents" or "stands for." The reference 
gap in turn implies an interpreter, an agent 
that can bridge the gap and make a map- 
ping from the expression that does the rep- 
resenting to the thing that is represented. 
This reference gap does not exist for events 
in the world. It occurs only when there are 
symbolic relations. This difference of per- 
spective provides two apparently incom- 
patible realms: a realm of action and objects 
and a realm of symbols. Computers pro- 
vide us a way to put these two together, but 
it is difficult to describe from the perspec- 
tive of either realm. 7 

From the perspective of the symbolic 
realm, we want the expressions in the in- 
terface language to be composed of the ob- 
jects and actions in the world of interest. 
From the perspective of the realm of ac- 
tions and objects, we want the objects and 
actions in the world to be the elements of 
the symbolic expressions that refer to them. 
The key to the sensation of directness is 
that technology permits the design of an in- 
terface-language/model-world that can ap- 
proximate this state of affairs to an extent 
impossible in any previous medium. 

By presenting the world of action as the 
interface language itself, we can collapse 
the symbolic reference gap. And if the mod- 
el world is properly designed, it can be the 
kind of artifact we sought. Consider a sys- 
tem like the Macintosh where files are re- 
moved by dragging icons that represent 
them to the trash can. From the point of 
view of our analysis, what is especially nice 
about this interface is that the physical con- 
straints of the world of file icons make cer- 
tain classes of syntactic errors impossible: It 



7 Gaver (1988) describes exactly this dichotomy, but 
in the context of distinguishing between cognitive and 
ecological (Gibsonian) views of psychology. Gaver's 
work provides a valuable step in the argument we 
present here. 



is not possible to try to remove a file that 
does not exist. 8 

The Design of Appropriately 
Constrained Artifacts 

The symbolic world is an artificial world. If 
syntax were the only constraint, the protec- 
tion against the impossible and the non- 
sensical is weak because it places a heavy 
burden on the knowledge of the user and 
the correctness of the operations. 

The weight watcher had two means to 
solve the problem — by manipulating 
symbols or by manipulating the cottage 
cheese. The culturally valued technique 
(taught in schools) is to do symbolic ma- 
nipulations on the symbolic representa- 
tions. The preferred technique of people is 
to manipulate the actual objects. 

Given this apparent conflict between 
cultural and individual preferences, there 
are at least two roads to take. One is to 
mechanize the manipulation of symbolic 
structures. That road leads to the comput- 
er. Another is to stay in the world of man- 
ual manipulation. This second road leads 
to a special class of computational artifacts 
in which the syntax of the represented do- 
main is built into the physical properties of 
the representing object. But how can we get 
syntactic constraints out of physical proper- 
ties? 

Locking the door to your car with the 
keys inside is an annoying error. One could 
imagine a checklist for leaving the car that 
could prevent this sequence error, but the.i 
the checklist would be a nuisance and 
would likely often be skipped. There is, 



8 Unfortunately, the collapse of the reference gap is 
never complete. There is still the mouse protocol and 
the key presses instead of getting our hands on the 
objects. Even touch screens leave a gulf. And the ob- 
jects of interest may or may not actually be the object 
of interest. What the object of interest is may depend 
upon the users goals ar any moment. 
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however, an infallible way to ensure that 
keys are not locked in the car. That is to 
make the car door so that it can only be 
locked with the key operated from the out- 
side. Norman (1988) called this type of 
scheme a "forcing function," for it provid- 
ed a physical constraint so that failure at 
one stage prevents the next step from hap- 
pening. 

Sequential dependencies among actions 
imply a kind of syntax of action. Doing the 
action in one order is a well-formed action, 
and violating sequential dependencies pro- 
duces a malformed action. But sequential 
relations are only one kind of relation that 
can be constrained. There are other sorts of 
syntax of action. Inserting a disk upside 
down or backwards in the disk drive is a 
"syntax" error that can be eliminated by de- 
signing disks and drives with physical con- 
straints that make only meaningful actions 
possible. Now let us see how to carry this 
method to the design of artifacts so that 
physical constraints preclude erroneous ac- 
tions. 

Intelligent artifacts are "representing 
worlds": That is, they form a representa- 
tion of some aspect of the external, physical 
world and do their operations upon this 
internal, representing world. If the artifact 
is to be readily usable, the constraints of the 
representing world must have a meaning- 
ful relationship to the constraints of the 
represented world. Furthermore, there 
must be a reason to do things in the artifact 
rather than directly in the world — why 
manipulate the representation if one can 
manipulate the real thing? In the case of 
mathematics, the reason is often that some 
important aspect of the represented world 
cannot be directly operated upon. In the 
case of the cottage cheese example, the ma- 
nipulations of the real object are messy, al- 
though still do-able: Other kinds of objects 
are not apt to be as susceptible to manipula- 
tion. In the case of computing the speed of 
the ship, no physical manipulation will do 



the trick — symbolic, numeric computa- 
tion is required. 

If a representing world is necessary, the 
design question for the artifact, then, is 
how to achieve the appropriate constraints 
in the representing world so that all possi- 
ble actions model meaningful actions in 
the represented world (and so that non- 
meaningful actions are not possible). 

Some artifacts, such as the nomogram, 
exploit the fact that they are physical as 
well as symbolic. The usual analysis of 
symbols ignores the fact that symbols are 
also physical things: The physical proper- 
ties are thought to be irrelevant to their 
symbolic, computational, or representa- 
tional properties. Certainly, the manipula- 
tion of the symbols has nothing to do with 
the constraints of the domain of interest. 
In fact, for many purposes the symbols 
need not be thought of as things at all. In- 
stead, they are treated as if they exist in 
their own realm, separated from the realm 
of "real" things. External representations of 
arithmetic are thought to be helpful only 
because they ease memory load, not be- 
cause there is anything in the physical 
structure of their external representation 
that is essential to the computation. 

The O-Symbol 

We now have on hand the conceptual 
pieces required to construct the new kind 
of artifact. First, select the objects that are 
to serve as the symbols in the system: Note 
that these are both objects and symbols — 
they are o-symbols (object-symbols) These 
o-symbols serve as the interface between 
the world of objects and the world of sym- 
bols, each being simultaneously in both 
the world of objects and of symbols. Now, 
in the world of objects, the items can ex- 
ploit the powers of forcing functions, us- 
ing physical constraints in such a way to 
force the symbolic interpretations to be 
meaningful. 
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The nomogram for computing boat speed 
(Figure 2) was constructed in this way. The 
physical constraints for the layout of the 
logarithmic scales (which are the symbolic 
representations for speed, distance, and 
time) make certain classes of syntactic er- 
rors impossible. 

Reinterpreting the Attraction of Direct 
Manipulation Interfaces 

We are now able to return to the problem 
area that triggered this investigation: the 
study of direct manipulation interfaces. 
Consider what one wants in an artifact: 
One wants structures that simplify the task 
for the person, that permit new computa- 
tions, or that produce new interpretations, 
all easier, faster, and more accurately or 
precisely than could be done without the 
artifact. And a ' "ajor enhancement of the 
artifact comes when it is so constructed that 
one can't do wrong, where the structures 
by their very construction are incapable of 
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FIGURE 4. Four modes of artifacts. 



producing syntactic violations. This, we be- 
lieve, is what is at the heart of the attrac- 
tiveness of direct manipulation interfaces 
to computers. 

True, DM interfaces have another attrac- 
tion: the naturalness of the mappings be- 
tween the represented and representing 
worlds. But this very naturalness also pro- 
vides an opportunity to be exploited by 
making the manipulations of the repre- 
senting world follow the physical con- 
straints of the represented world. This is 
one reason why the visible file folders met- 
aphor for organizing computer files can be 
so appealing. But it is not just any repre- 
senting world that works this way: DiM 
works when the physical constraints of 
manipulating the representation prevent it 
from representing anything nonmeaning- 
ful or impossible. 

We have now considered several modes 
of computation that turn out to fill the 
four spaces of a two-by-two array: One di- 
mension represents the kinds of con- 
straints — syntactic or physical; the other 
dimension represents the kind of action 
done by the artifact — passive or active. 
The combination gives the four computa- 
tional modes that are shown in Figure 4. 
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Appendix: Draft Checklist for the 
Design of an Artifact 

It is tempting to use the analyses of this re- 
port as design rules. This operation is pre- 
mature for the full understanding of cogni- 
tive artifacts and, in particular, the o- 
symbol, do not yet exist. Nonetheless, it is 
possible to provide a draft checklist of de- 
sign r ocedures that follow the principles 
studied in this research project. The princi- 
ples include those discussed in this report 
plus the analysis of human action into sev- 
en stages of action (Norman, 1986, 1988) and 
an emphasis on the proper role of con- 
straints, affordances, and mappings (Nor- 
man, 1988). This checklist needs elaboration 
and verification, but it illustrates how the 
work conducted under this research con- 
tract could be applied to the design setting. 

1. Do a task analysis. 
What needs to be done? 

What are the variables to be controlled, to 
be determined? 

What operations or actions need to be con- 
trolled? 

What feedback is needed to the user? 
What errors are possible? 

2. Do a system analysis. 

Where does the task fit in the general over- 
all picture? 

How do the various participants interact 
with one another? 

What are the hidden goals of the task? 

If the task or method of doing the task 
changes, what side-effects result in the 
rest of the organization? 

3. Select the o-symbols. 

4. Do the design. 

Develop the forcing function. 

Make operations visible. 

Make results visible. 

Get the affordances right. 

Follow the seven stages as design guides. 



5. Redo the task and system analysis. 
Are *11 meaningful operations possible? 
Are all nonmeaningful operations impos- 
sible? 

What kinds of social interactions are sup- 
ported? What kinds are made more dif- 
ficult? 

Does the user operate directly on the o- 
symbols? 

Are the affordances right— does the first- 
time or casual user need extra guid- 
ance? 

Is the feedback right? Can the user tell 

what has been done? 
How are errors: 

Discovered? 

Recovered from? 

Corrected? 

6. Watch typical users in a realistic situa- 

tion. 

Can they use the artifact appropriately? 
Do the o-symbols work properly — are they 

interpreted properly? 
Does the artifact fulfill the dosign goals? 
Do they wish to use the artifact in ways not 

fully supported, such as: 

For repeated operations, keeping some 
previous values? 

In a different order than contemplated? 

For a different purpose? 

Etc. 

And, if so, does the design support these 

needs? 
Do they make errors? 
Do they discover all their errors? 
Can they recover (gracefully) from error? 
Would they stop their current methods 

and switch to the artifact? 

7. Repeat design effort from 2, with empha- 

sis on all "no" answers, to 6. 
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